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Mouse Dnmt3o DMA sequence 1/38 
1 GAATTCCGGC CTGCTCCCGG GCCGCCCGAC CCGCCGGGCC ACACGGCAGA 
51 GCCGCCTGAA GCCCAGCGCT GAGGCTGCAC TTTTCCGAGG GCTTGACATC 
101 AGGGTCTATG TTTAAGTCTT AGCTCTTGCI TACAAAGACC ACGGCAATTC 
151 CTTCTCTGAA GCCCTOGCAG CCCCACAGCG CCCTCGCAGC CCCAGCCTGC 
201 CGCCTACTGC CCAGCAATGC CCTCCAGCGG CCCOGGGGAC ACCAGCAGCT 
251 CCTCTCTGGA GCGGGAGGAT GATCGAAAGG AAGGAGAGGA ACAGGAGGAG 
301 AACCGTGGCA AGGAAGAGCG CCAGGAGCCC AGCGCCACGG CCCGGAAGGT 
351 GGGGAOGCCT GGCCGGAAGC GCAAGCACCC ACCGGTGGAA AGCAGTGACA 
401 CCCCCAAGGA CCCAGCAGTG ACCACCAAGT CTCAGGCCAT GGCCCAGGAC 
45 1 TCTGGCCCCT CAGATCTGCT ACCCAATGGA GACTTGGAGA AGCGGAGTGA 
501 ACCCCAACCT GAGGA^GGGA GCCCAGCTGC AGGGCAGAAG GGTGGGGCCC 
551 CAGCTGAAGG AGAGGGAACT GAGACCCCAC CAGAAGCCTC CAGAGCTGTG 
601 GAGAATGGCT GCTGTGTGAC CAAGGAAGGC OCTGGAGCCT CTGCAGGAGA 
651 GGGCAAAGAA CAGAAGCAGA CCAACATCGA ATCCATGAAA ATGGAGGGCT 
701 CCCGGGGCCG ACTGCGAGGT GGCTTGGGCT GGGAGTCCAG CCTCCGTCAG 
751 CGACCCATGC CAAGACTCAC CTTCCAGGCA GGGGACCCCT ACTACATCAG 
801 CAAACGGAAA CGGGATGAGT GGCTGGCACG TTGGAAAAGG GATGCTGAGA 
851 AGAAAGCCAA GGTMITGCA GTAATGAATG CTGTGGAAGA GAACCAGGCC 
901 TCTGGAGAG7 CTCAGAAGGT GGAGGAGGCC AGCCCTCCTG CTGTGCAGCA 
951 GCCCACGGAC CCTGCTTCTC CGACTGTGGC CACCACCCCr GAGCCAGTAG 
1001 GAGGGGATGC TGGGGACAAC AATGCTACCA AAGCACCCGA CGATGAGCCT 
1051 GAG7ATGAGG ATGGCCGGGG CTTTGGCATT GGAGAGC7GG TGTGGGGGAA 
101 ACHCGGGGT TTCTCTTGGT GGCCAGGCCG AATTGTGTCT TGGTGGATGA FIG. 1 A~ J 
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1151 CAGGCCGGAG CCGAGCAGCT GAAGGCACTC GCTGGGTCAT GTGGTTCGGA 2 ^ 38 
1201 GATGGCAAGT TCTCAGTGGT GTGTGTGGAG AAGCTCATGC CGCTGAGCTC 
1251 CTTCTGCAGT GCATTCCACC AGGCCACCTA CAACAAGCAG CCCATGTACC 

1301 GCAAAGCCAT CTACGAAGTC CTCCAGGTGG CCAGCAGCCG TGCCGGGAAG 

1351 CTGTTTCCAG CITGCCATGA CAGTGATGAA AGTGACAGTG GCAAGGCTGT 

1401 GGAAGTGCAG AACAAGCAGA TGATTGAATG GGCCCTCGGT GGCTTCCAGC 

1451 CCTCGGGTCC TAAGGGCCTG GAGCCACCAG AAGAAGAGAA GAATCCTTAC 

15Q1 AAGGAAGTTT ACACCGACAT GTGGGTGGAG CCICAAGCAG CTGCTTACGC 

1551 CCCACCCCCA CCAGCCAAGA AACCCAGAAA GAGCACAACA GAGAAACCTA 

1601 AGGTCAAGGA GATCATTGAT GAGCGCACAA GGGAGCGGCT GGTGTATGAG 

1651 GTGCGCCAGA AGTGCAGAAA CATCGAGGAC ATTTGTATCT CATGTGGGAG 

1701 CCTCAATGTC ACCCTGGAGC ACCCATTCTT CATTGGAGGC ATGTGCCAGA 

1751 ACTCTAAGAA CTGCTTCTTG GAGTGTGCTT ACCAGTATGA CGACGATGGC 

1801 TACCAGTCCT ATTGCACCAT CTGCTC7GGG GGGCGTGAAG TGCTCATGTG 

1S51 TGGGAACAAC AACTGCTGCA GGTGCTTTTG TGTCGAGTGT GTG5ATCTCT 

1901 TGGTGGGGCC AGCAGCTGCT CAGGCAGCCA TTAAGGAAGA CCCCIGGAAC 

1951 TGCTACATGT GCGGGCATAA GGGCACCTAT GGGCTGCTGC GAAGACGGGA 

2001 AGACTGGCCT TCTCGACTCC AGATGTTCTT TGGCAATAAC CATGACCAGG 

2051 AATTTGACCC CCCAAAGGTT TACCCACCIG TGCCAGCTGA GAAGAGGAAG 

2101 CCCATCCGCG TGCTGTCTCT CTTTGATGGG ATTGCTACAG GGCTCCTGGT 

2151 GCTGAAGGAC CTGGGCATCC AAGTGGACCG CTACATTGCC TCCGAGGTGT 

2201 CTGAGGACTC CATCACGGTG GGCATGGTCC GGCACCAGGG AAAGATCATG 

2251 TACGTCGGGG ACCTCCGCAG CGTCACACAG AAGCATATCC AGGAGTGGGG 

2301 CCCATTcGAC cTGGTGATTG GAGGCAGTCC CTGCAATGAC CTcTCCATTG FIG. 1 A ~2 
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2351 TCAACCCTGC CCGCAAGGGA CTTTATGAGG GTACTGGCOG CCTCTTCTTT 3/38 

2401 GAGTTCTACC GCCTCCTGcA TGATGCGCGG CCCAAGGAGG GAGATGATCC 

2451 CCCCTTCTTC TGGCTCTTTG AGAATGTGGT GGCCATGGGC GTTAGTGACA 

2501 AGAGGGACAT CTCGCGATTT CTTGAGTCTA ACCCCGTGAI GATTGACGCC 

2551 AAAGAAGTGT CTGCTGCACA CAGGGCCCGT TACTTCTGGG GTAACCTTCC 

2801 TGGCATGAAC AGGCCTTTGG CATCCACTCT GAATGATAAG CTGGAGCTGC 

2651 AAGAGTGTCT GGAGCACGGC AGMTAGCCA AGTTCAGCAA AGTGAGGACC 

2701 ATTACCACCA GGTCAAACTC TATAAAGCAG GGCAAAGACC AGCATTTCCC 

2751 CGTCTfCATG AACGAGAAGG AGGACATCCT GTGGTGCACT GAAATGGAM 

2801 GGGTGTTTGG CTTCCCCGTC CACTACACAG ACGTCTCCAA CATGAGCCGC 

2851 TTGGCGAGGC AGAGACTGCT GGGCCGATCG TGGAGCGTGC COGTCATCCG 

2901 CCACCTCTTC GCTCCGCTGA AGGAATATTT TGCTTGTGTG TAAGGGACAT 

2951 GGGGGCAAAC TGAAGTAGTG ATGATAAAAA AGTTAAACAA ACAAACAAAC 

3001 AAAAAACAAA ACAAAACAAT AAAACACCAA GAAGGAGAGG ACGGAGAAAA 

3051 GTTCACCACC CAGAAGAGAA AAAGGAATTT AAAGCAAACC ACAGAGGAGG 

3101 AAAACGCCGG AGGGCTTGGC CTTGCAAAAG GGTTGGACAT CATCTCCTGA 

3151 GTTTTCAATG TTAAGCTTCA GTCCTATCTA AAAAGCAAAA TAGGCCCCTC 

3201 CCCTTCTTCC CCTCQSGTCC TAGGAGGCGA ACTTTTTGTT TTCTACTCTT 

3251 TTTCAGAGGG GTTTTCTGTT TGTTTGGGTT TTTGTTTCTT GCTGTGACTG 

3301 AAACAAGAGA GTTATTGCAG CAAAATCAGT AACAACAAAA AGTAGAAATG 

3351 CCTTGGAGAG GAAAGGGAGA GAGGGAAMT TCTATAAAAA CTTAAMTAT 

3401 TGGTTTTTTT TTTTTTTCCT TTTCTATATA TCTCTTTGGT TGTCTCTAGC 

3451 CTGATCAGAT AGGAGCACAA ACAGGAAGAG AATAGAGACC CTCGGAGGCA 

3501 GAGTCTCCTC TCCCACCCCC CGAGCAGTCT CAACAGCACC ATTCCTCGTC_ FIG. 1 A - "5 
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3551 ATGCAAAACA GAACCCAACT AGCAGCAGGG CGCTGAGAGA ACACCACACC 

3601 AGACACTTTC TACAGTATTT CAGGTGCCTA CCACACAGGA AACCTTGAAG 

* 3651 AAAACCAGTT TCTAGAAGCC GCTGTTACCT CTTGTTTACA GTTTATATAT 

g 3701 ATATGATAGA TATGAGATAT ATATATATAA AAGGTACTGT TAACTACTGT 

| 3751 ACATCCCGAC TTCATAATGG TGCTTTCAAA ACAGCGAGAT GAGCAAAGAC 

r 3801 ATCAGCTTCC GCCTGGCCCT CTGTGCAAAG GGTTTCAGCC CAGGATGGGG 

N 3851 AGAGGGGAGC AGCTGGAGGG GGTTTTMCA AACTGAAGGA TGACCCATAT 

U 3901 CACCCCCCAC CCCTCCCCCA TGCCTAGCTT CACCTGCCAA AAAGGGGCTC 

h 3951 AGCTGAGGTG GTCGGACCCT CGGGAAGCTG AGTCTGGAAT TTATCCAGAC 

4001 TCGCGTGCAA TAACCTTAGA ATATGAATCT AAAATGACTG CCTCAGAAAA 

4051 ATGGCTTGAG AAAACATTGT CCCTGATTTT GAATTCGTCA GCCACGTTGA 

4101 AGGCCCCTTG 7GGGATCAGA AATATTCCAG AGTGAGGGAA AGTGACCCGC 

4151 CATTAACCCC NCCTGGAGCA AATAAAAAAA CATACAAAAT GT 
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Mouse Dntnt3b1 DNA Sequence 
1 GMTTCCGGG CGCCGGGGTT AAGOGGCCCA TAAACGTA GCGCAGCGAT 
51 GGGCGCCGGA GATTCGCGAA CCCGACACTC CGCGCCCCCC GCCGGCCAGG 
101 ACOCGCGGCG CGATCGCGGC GCOGCGCTAC AGCCAGCCTC ACGACAGGCC 
151 CGC7GAGGCT TGTGOCAGAC CTTGGAAACC TCAGGTATAT ACCTTTCCAG 
201 ACGCGGGATC TCCCCTCCCC CATCCATAGT GCCTTGGGAC CAAATCCAGG 
251 GCCTTCTTTC AGGAAACAAT GAAGGGAGAC AGCAGACA.TC TGAATGAAGA 
301 AGAGGGTGCC AGCGGGTATG AGGAGTGCAT TATCGTTAAT GGGAACTTCA 
351 G7GACCAGTC CTCAGACACG AAGGATGCTC CCTCACCCCC AGTCTTGGAG 
W GCAATCTGCA CAGAGCCAGT CTGCACACCA GAGACCAGAG CCCGCAGGTC 
«1 AAGCTCCCGG CTGTCTAAGA gGGAGGTCTC CAgCCTTCTG AATTACAOGC 
50! AGGACATGAC AGGAGATGGA GACAGAGATG ATGAAGTAGA TGATGGGAAT 
551 GGCTCTGATA TTCTAATGCC AAAGCTCACC CGTGAGACCA AGGACACCAG 
601 GACGCGCTCT GAAAGCCCGC CTGTCCGAAC CGGACATAgC AATGGGACCT 
651 CCAGCrTGGA GAGGCAAAGA GCCTCCCCCA gAATCACCOG AGGTOGGCAG 
701 GGCCGCCACC ATGTGCAGGA GTACCCTGTG GAGTTTCOGG CTACCAGGTC 
751 TCGGAGACGT CGAGCATCGT CTTCAGCAAG CACGCCATGC TCATCCCCTG 
801 CCAGOGTCGA CTTCATGGAA GAAGTGACAC CTAAGAGCGT CAGTACCCCA 
851 TCAGTTGACT TGAGCCAGGA TGGAGATCAG GAGGGTATGG ATACCACACA 
901 GG1GGATGCA GAGAGCATAT ATGGAgACAC CACAGAGTAT CAgGATGATA 
951 AAGAGniGG AATAGGTGAC CTCGTGTGGG GAAAGATCAA GGGCTTCTCC 
1001 TGGTGGCCTG CCATGGTGGT GTCCTGGAAA GCCACCTCCA AgCGACAGGC 
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t051 CATGCCOGGA ATGCGCTGGG TACAGTGGTT TGGTGATGGC AACTTTTCTG 6/38 
1101 AGATCTCTGC TGACAAACTG GTGGCTCTGG GGCTGTTCAG CCAGCACTTT 
1151 MTCTGGCTA CCTTCAATAA GCTGGTTTCT TATAGGAAGG CCATGTACCA 
1201 CACTCTGGAG AAAGCCAGGG T7CGAGCTGG CAAGACCTTC TCCAGCAGTC 
1251 CTGGAGAGTC ACTGGAGGAC CAGCTGAAGC CCAIGCTGGA GTGGGCCCAC 
1301 GGTGGCTTCA AGCCTACTGG GATGGAGGGC CTCAAACCCA ACAAGAAGCA 
1351 A0CAGTGGT7 AATAAGTCGA AGGTGCGTCG TTCAGACAGT AGGAACTTAG 
HOI AACCCAGGAG ACGOGAGAAC AAAAGTCGAA GACGCACAAC CMTGACTCT 
1451 GCTGCTTCTG AGTCCCCCCC ACCCAAGCGC CTCAAGACAA ATAGCTATGG 
1501 CGGGAAGGAC CGAGGGGAGG ATGAGGAGAG CCGAGAACGG ATGGCTTCTG 
1551 AAGTCACCAA CAACAAGGGC AATCTGGAAG ACOGCTCTTT GTCCTGTGGA 
1601 AAGAAGAACC CTGTGTCCTT CCACCCCCTC TTfGAGGGTG GGCTCTGTCA 
1651 GAGTTGCCGG GATCGCTTGC TAGAGCTCT7 CTACATGTAT GATGAGGACG 
1701 GCTATCAGTC CTAC7GCACC GTGIGCTGTG AGGGCCGTGA ACTGCTGC7G 
1751 TGCAGTAACA CAAGCTGCTG CAGATGCTTC TGTGTGGAGT GTCTGGAGCT 
1801 GCTGGTGGGC GCAGGCACAG CTGAGGATGC CAAGCTGCAG GAACCCTGGA 
1851 GCTGCTATAT GTGCCTCCCT CAGCGCTGCC ATGGGGTCCT CCGACGCAGG 
1901 AAAGATTGGA ACATGCGCCf GCAAGACTTC TTCACIACTG ATCCTGACCT 
1951 GGAAGAATTT GAGCCACCCA AGTTGTACCC AGCAATTCCT GCAGCCAAAA 
2001 GGAGGCCCAT TAGAGTCCTG TCTCTGTTTC ATGGAAITGC AACGCGCTAC 
2051 TTGGTGCTCA AGGAGTTGGG TATrAAAGtG CAAAA6TACA TTGCCTCCGA 
2101 AGrCTGTGCA GAGTCCATCG CTGTGGGAAC TGTTAAGCAT GAAGGCCAGA 
2151 TCAAATATGT CAATGACGTC CGGAAAATCA CCAAGAAAAA TATTGAAGAG 
2201 TGGGGCCCGT TCGACTTGGT GATTGGTGGA AGCCCATGCA ATGATCrCTC FIG. 1 B~2 
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2251 TAACGTCAAT CCTGCCCGCA AAGG7T7A7A TGAGGGCACA GGAAGGC7C7 7/38 
2301 7C7TCGAG77 77ACCAC77G C7GAA77A7A CCCGCCCCAA GGAGGGCGAC 
2351 AACCCTCCAT TCTTCTGGAT GTTCGAGAAT GTTGTGGCCA TGAAAGTGAA 
2401 TGACAAGAAA GACATCTCAA GATTCCTGGC A7G7AACCCA GTGATGATCG 
2451 ATGCCATCAA GGTGTCTGCT GCTCACAGGG CCCGG7AC7T CTGGGGTAAC 
2501 CTACCCGGAA TGAACAGGCC CGTGATGGCT TCAAAGAAIG ATAAGCTCGA 
2551 GCTGCAGGAC TGCCTGGAGT TCAGTAGGAC AGCAAAG7TA AAGAAAGTGC 
2601 AGACAATAAC CACCAAG7CG AACTCCATCA GACAGG5CAA AAACCAGCTT 
I 2651 TTCCCTGTAG TCATGAATGG CAAGGACGAC GITTTGTGGT GCACTGAGCT 

1 2701 CGAAAGGA1TC TTOGGCTTCC CTCCTCACTA CACGGACGTG TCCAACATGG 

T 2751 GCCGCGGCGC CCG7CAGAAG CTGCTGGGCA GGTCCTGGAG TGTACCGGTC 

^ 2801 ATCAGACACC TGTTTGCCCC CTTGAAGGAC TACITTGCCT GTGAATAGTT 

| 2851 CTACCCAGGA CTGGG5AGCT CTCGGTCAGA GCCAGTGCCC AGAGTCACCC 

2901 CTCCCTGAAG GCACCTCACC TGrCCCCTTT 77AGC7CACC TGTGTGGGGC 
2951 CTCACATCAC TGTACCTCAG CTTTCTGCTG CTCAGTGGGA GCAGAGCCTC 
3001 CTGGCCCTTG CAGGGGAGCC CCGGTGCTCC CTCCG7CTGC ACAGCTCAGA 
3051 CCTGGCTGCT TAGAGTAGCC CGGCATGGTG CTCATGTTCT CTTACCCTGA 
3101 AACTTTAAAA C77GAAG7AG GTAGTAAGAT GGCTTTCTTT. TACCCTCCTG 
3151 AGTTTATCAC TCAGAAGTGA TGGCTAAGAT ACCAAAAAAA CAAACAAAAA 
3201 CAGAAACAAA AAACAAAAAA AAACC7CAAC AGCTCTcTTA GTACTCAGGT 
3251 TCATGCTGCA AAATCACTIG AGATTTTGn TTTAAGTAAC CCGTGcTCcA 
3301 CA777GC7GG AGGA7GC7A7 7CTGAATG7G GGC7CAGATG AGCAAGG7CA 
3351 AGGGGCCAAA AAAM7rcCC CC7CTCCCCC CAGGAGTA77 TGAAGATGA7 
3401 G7T7A7GG77 7AAG7CTTCC 7GGCACC77C CCC77GC777 GGTACAAGGG FIG. 1 8*5 
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3451 CTGAAGTCCT GTTGGTCTTG MGCATTTCC CAGGATGATG ATGTCAGCAG 
3501 GGATGACATC ACCACCTTTA GGGCTTTTCC CTGGCAGGGG CCCATGTGGC 
3551 TAGTCCTCAC GAACACTGGA GTAGAATGTT TGGAGCTCAG GAAGGGTGGG 
3501 TGGAGTGGOC CTCTTCCAGG TGTGAGGGAT AGGAAGGAGG AAGCTTAGGG 
3651 AAATCCATTC CCCACTCCCT CTTGCCAAAT GAGGGGCCCA GTCCCCAACA 
3701 GCTCAGGTCC CCAGAACCCC CTAGTTCCTC ATGAGAAGCT AGGACCAGAA 
3751 GCACATCGTT CCCCTTATCT GAGCAGTGTT TGGGGAACTA CAGTGAAAAC 
3B01 CTTCTGGAGA TGTTAAAAGC TTTTTACCCC AOGATAGATT GTGTTTTIAA 
3851 GGGGTGCTTT TTTTAGGGCC ATCACTGGAG ATAAGAAAGC TGCATTTCAG 
3901 AAATGCCATC GTAATGGTTT TTAAACACCT TTTACCTAAT TACAGGTGCT 
3951 ATTTTATAGA AGCAGACAAC ACTTCTTTTT ATGACTCTCA GACTTCTATT 
4001 TTCATGTTAC CATTTTTTTT GTAACTOGCA AGGTGTGGGC HTTGTAACT 
4051 TCACAGGTGT GGGGAGAGAC TGCCTTGTTT CAACAGTTTG TCTCCACTCG 
4101 TTTCTAATTT TTAGGTGCAA AGAIGACAGA TGCCCAGAGT TTACCTT7CT 
4151 ' GGTTGATTAA AGTTGTATTT CTCTAAAAAA AAAAAAAAAA AAAAA 
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Human DNMT3A Dltt Sequence g^2 B 
1 CGCCGGCGTC GACCGACAGC GAGOGGAGGG AGGGAGCGAG CGAGCCAGCA 
51 GCAGCGGCCG GGAGGGAGGG AGGGCGCGCG GGCGGOGGCG GCGGCGAGAG 
101 CAGAGGACGA GCCGGGACGC GGCGCCGCGG CACCAGGGCG CGCAGCCGGG 
151 CCGGCCCGAC CCCACCGGCC ATAGGGTGGA GCCATCGAAG CCCCCAGCCA 
201 CAGGCTGACA GAGGCACCGT TCACCAGAGG CCTCAACACC GGGATCTATG 
251 TTTAAGTTTT AAC7CTCGCC TCCAAAGACC ACGATAATTC CTTCCCCAM 
301 GCCCAGCAGC CCCCCAGCCC CGCGCAGCCC CAGCCTGCCT CCCGGCGCCC 
351 AGATGCCCGC CATGCCCTCC AGCGGCCCCG GGGACACCAG CAGCTCTGCT 
401 GCGGAGCGGG AGGAGSACCG AAAGGACGGA GAGGAGCAGG AGGAGCCGCG 
451 TGGCAAGGAG GAGCGCCAAG AGCCCAGCAC CACGGCACGG AAGGTGGGGC 
501 GGCCTGGGAG GAAGCGCAAG CACCCCCCGG TGGAAAGCGG TGACACGCCA 
551 AAGGACCCTG CGGTGATCTC CAAGTCCCCA TCCATGGCCC AGGACTCAGG 
601 CGCCTCAGAG CTATTACCCA ATGGGGACTT GGAGAAGCGG AGTGAGCCCC 
651 AGCCAGAGGA GGGGAGCCCT GCTGGGGGGC AGAAGGGCGG GGCCCCAGCA 
701 GAGGGAGAGG GTGCAGCTGA GACCCTGCCT GAAGCCTCAA GAGCAGTGGA 
751 AAATGGCTGC TGCACCCCCA AGGAGGGCCG AGGAGCCCCT GCAGAAGCGG 
801 GCAAAGAACA GAAGGAGACC AACATCGAAT CCATGAAAAT GGAGGGCTCC 
351 CGGGGCCGGC 1GCGGGGTGG CTTGGGCTGG GAGTCCAGCC TCCGTCAGCC 
901 GCCCATGCCG AGGCTCACCT TCCAGGCGGG GGACCCCTAC TACATCAGCA 
951 AGCGCAAGCG GGACGAGTGG CTGGCACGCT GGAAAAGCGA GGCTGAGAAG 
1001 AAAGCCAAGG TCATTGCAGG AATGAATGCT GTGGAAGAAA ACCAGGGGCC 

1051 CGGGGAGTCT CACAAGGTGG AGGAGGCCAG CCCTCCTGCT GTGCAGCAGC AMENDED SHEET 

1101 CCACTGACCC CGCATCCCCC ACTGTGGCTA CCACGCCTGA GCCCGTGGGG 
1151 TCCGATGC7G GGGACAAGAA TGCCACCAAA GCAGGCGATG ACGAGCCAGA FIG. 1 C~ 1 
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1201 GTAOGAGGAC GGCCGGGGCT rTGGCATTGG GGAGCTGGTG TGGGGGAAAC 
1251 TGCGGGGCTT CTCCTGGTGG CCAGGCCGCA TTGTGTCTTG GTGGATGACG 
1301 GGCCGGAGCC GAGCAGCTGA AGGCACCCGC TGGCTCATGr GGTTCGGAGA 
1 351 CGGCAAATTC TCAGTGGrGT GTGTTGAGAA GCTGATGCCG CTGAGCTCGT 
. 1401 TTTGCAGTGC GTTCCACCAG GOCACGTACA ACAAGCAGCC CATGTACOGC 
1451 AAAGCCATCT ACGAGGTCCT GCAGGTGGCC AGCAGCCGCG CGGGGAAGCT 
1501 GTTCCOGGTG TGCCACGACA GCGATGAGAG 7GACACTGCC AAGGCCGTGG 
1551 AGGTGCAGAA CAAGCCCATG ATTGAATGGG CCCTGGGGGG CTTCcAGCAT 
1601 TATGGCCCTA AGGGCCTGGA GCCACCAGAA GAAGAGAAGA ATCCCTACAA 
1651 AGAAGTGTAC ACGGACATGT GGGTGGAACC TGAGGCAGCT GCATACGCAC 
170? CACCTCCACC AGCCAAAAAG CCCCGGAAGA GCACAGCGGA GAAGCCCAAG 
1751 GTCAAGGAGA TTATTGATGA GGGCACAAGA GAGCGGcTGG TGTACGAGGT 
1801 GCGGCAGAAG TGCCGGAACA T7GAGGACAT CTGCATCTCC TGTGGGAGCC 
1851 TCAATGTTAC CCTGGAACAC CCCCTCTTCG TTGGAGGAAT GTGCCAAAAC 
1801 TGCAAGAACT GCTTTCTGGA GTGTGCGTAC CAGTACGACG ACGACGGCTA 
1951 CCAGTCCTAC TGCACCATCT GCTGTGGGGG CCGTGAGCTG CTCATGTGCG 
• 2001 GAAACAACAA CTGCTGCAGG TGCTTTTGCG TGGAGTGTGT GGACCTCTTG 
2051 GTGGGGCCGG GGGCTGCCCA gGCAGCCATT AAGGAAgACC CCTGGAACTG 
2101 CTACATGTGC GGGCACAAGG GTACCTACGG GCTGCTGCGG CGGCGAAAGG 
2151 ACTGGCCCTC CCGGCTCCAg ATGTTCTTCG CTAATAACCA CgACCAGgAA 
2201 TTTG.ACCCTC CAAAGGTTTA CCCACCTGTC CCAGCTgAgA AAAGGAAGCC 
2251 CATCCGGCTG C7GTCTCTCT TrGATGGAAT CGCTACAGGG CTCCTGGTGC 
2301 TGAAGGACTT GGGCATTCAG GIGGACCGCT ACATTGCCTC GGAGGrGTGT FIG 
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2351 CAGGACTCCA TCAOGGTGGG CATGGTGCGG CACCAGGGGA AGATCA7GTA U/3Q 
2401 CGTCGGGGAC GTCCGCAGCG TCACACAGAA GCA7ATCCAG GAGIGGGGCC 
2451 CATTCGAICT GGTGATTGGG GGCAG7CCC7 GCAATGACCT CTCCATOGTC 
2501 AACCCTGCTC GCAAGGGCCT C7ACGAGGGC ACTGGCCGGC TCTTCTTTGA 
2551 GTTCTACCGC CTCC7GCATG ATGGGCGGCC CAAGGAGGGA GATGATCGCC 
2601 CCTTCTTCTG GCTCTTTGAG AATGTGGTGG CCATGGGCGT TAGTGACAAG 
2651 AGGGACATCT CGCGATTTCT CGAGTCCAAC CCTGTGATGA TTGATGCCAA 
2701 AGAAGTGTCA GCTGCACACA GGGCCCGCTA CTTCTGGGGT AACCTTCCCG 
2751 GTATGAACAG GCCGTTGGCA TCCACTGTGA ATGATAAGCT GGAGCTGCAG 
2801 GAGTGTCTGG AGCATGGCAG GATAGCCAAG TTCAGCAAAG TGAGGACCAT 
2851 TACTAOGAGG TCAAACTCCA TAAAGCAGGG" CAAAGACCAG CA7TTTCCTG 
2901 TCTTCATGAA TGAGAAAGAG GACATCTTAT GGTGCACTGA AATGGAAAGG 
2951 GTATTTGGTT TCCCAGfCCA CTATACTGAC GTCTCCAACA TGAGCCGCTT 
3001 GGCGAGGCAG AGACTGCTGG GCCGGTCATG GAGCGTGCCA GTCATCCGCC 
3051 ACCTCTTCGC TCCGCTGAAG GAGTATTTTG CGTGTGIGTA AGGGACATGC 
3101 GGGCAAACTG AGGTAGCGAC ACAAAGTTAA ACAAACAMC AAAAAACACA 
3151 AAACATAATA AAACACCAAG AACATGAGGA TGGAGAGAAG TATCAGCACC 
3201 CAGAAGAGAA AAAGGAATTT AAAACAAAAA CCACAGAGGC GGAAATACCG 
3251 CAGGGCTTTC CCTTGCGAAA AGGGTTGGAC ATCATCTCCT GATTTTTCAA 
330) TGTTATTCTT CAGTCC7ATT TAAAAACAAA ACCAAGCTCC CTTCCCTTCC 
3351 TCCCCCTICC CTTTTnTTC GGTCAGACCT T7TA7TTTCT ACTCTTTTCA 
3401 GAGGGGTTTT CTGTTTGTTT GGGTTI7GTT TCTTGCTGTG ACTGAAACAA 
3451 GAAGGT7ATT GCAGCAAAAA 7CAG7AACM AAAA7AG7AA CAA7ACC7TG 
1501 CAGAGGAAAG GTGGGAGGAG AGDAAAAAAG GGAAA7T77T AAAGAAA7C7 FTG. 1 C~ 3 
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3551 ATATATTGGG nCTTTTTTT TTTTGTTTTI TGTTTTTTFT TTTTGGGTTT 
3601 TTTTTrTTTA CTAWATCT TnrntGTT CTCTCTACCC TGATCAGArA 
3551 GGAGCACAAG CAGGGGACGG AAAGAGAGAG ACACTCAGGC GGCAGCATTC 
3701 CCTCCCAGCC ACTGAGCTGT CGTGCCAGCA CCATTCCTGC TCACGCAAAA 
3751 CAGAACCCAG TTAGCAGCAG GGA6ACGAGA ACACCACACA AGACATTTTT 
3601 CTACAGrATT TCAGGTGCCT ACCACACAGG AAACCTTGAA GAAAATCAGT 
3851 TTCTAGAAGC CGCTCTTACC TCTTGTTTAC AGTTTATATA TATATGATAG 
3901 ATATGAGATA TATATATAAA AGGTACTGTT AACTACTGTA CAACCCGACr 
3951 TCATAATGGT GCTTTCAAAC AGCGAGATGA GTAAAAACAT CAGCTTCCAC 
4001 GTTGCCTTCT GCGCAAAGGG TTTCACCAAG GATGGAGAAA GGGAGACAGC 
4051 TTGCAGATGG CGCGTTCTCA CGGTGGGCTC ITCCCCTTGG TT7GTAACGA 
4101 AGTGAA5GAG GAGAACTTGG GAGCCAGGTT CTCCCTGCCA AAAAGGGGGC 
4151 TAGATGAGGT GGTCGGGCCC GTGGACAGCT GAGAGTGGGA TTCATCCAGA 
4201 CTCATGCAAT AACCCTTTGA TTCTTTTCTA AAAGGAGACT CCCTCGGCAA 
4251 GATGGCAGAG GGTAOGGAGT CTTCAGGCCC AGTTTCTCAC TTTAGCCAAT 
4301 TCGAGCGCTC CTTGTGGTGG. GATCAGAACT AATCCAGAGT GTGGGAAAGT 
4351 GACAGTCAAA ACCCCACCTG GAGCMATAA AAAAACATAC AAAACGTAAA 
4401 AAAAAAAAAA AAAAAA 
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Huron DNMT3B1 DNA Sequence: -\Z/3b 
1 GGCCGCGAAT TCGGCACGAG CCCTGCADGG CCGCCAGCCG GCCTCCCGCC 
51 AGCCAGCCCC GACCCGCGGC TCOGCCGCCC AGCCGCGCCC CAGCCAGCCC 
'01 TGCGGCAGGA AAGCATGAAG GGAGACACCA GGCATCTCAA TGGAGAGGAG 
151 GACGCCGGCG GGAGGGAAGA CTCGATCCTC CTCAACGGGG CCTGCAGCGA 
201 CCACTCCTCC GACTCGCCCC CAATCCTGGA GGCTATCCGC ACCCCGGAGA 
25! TCAGAGGCCG AAGATCAAGC TCGCGACTCT CCAAGAGGGA GGTGTCCAGT 
301 CTGCTAAGCT ACACACAGGA CTTGACAGGC GATGGCGACG GGGAAGATGG 
351 GGATGGCTCT GACACCCCAG TCATGCCAAA GCTCTTCCGG GAAACCAGGA 
401 CTCGTTCAGA AAGCCCAGCT GTCGGAACTC GAAATMCAA CAGTGTCTCC 
«1 AGCCGGGAGA GGCACAGGCC TrCCCCACGT TCCACCCGAG GCCGGCAGGG 
501 COGCAACCAT GTGGACGAGT CCCCCGTGGA GTTCCCGGC7 ACCAGGTCCC 
y 551 TGAGACGGCG GGCAACAGCA TCGGCAGGAA CGCCATGGCC GTCCCCTCCC 

M 601 AGCTCTTACC TTACCATCGA CCTCACAGAC GACACAGAGG ACACACATGG 

651 GACGCCCCAG AGCAGCAGTA CCCCCTACGC CCGCCTAGCC CAGGACAGCC 
701 AGCAGGGGGG CATGGAGTCC CCGCAGGTGG AGGCAGACAG TGGAGATGGA 
751 GACAGTTCAG AGTATCAGGA TGGGAAGGAG TTTGGAATAG GGGACCTCGT 
801 GTGGGGAAAG ATCAAGGGCT TCTCCTGGTG GCCCGCCATG GTGGTGTCTT 
851 GGAAGGCCAC C7CCAAGCGA CAGGCTATGT CTGGCATGCC GTGGGTCCAG 
901 TGGTTTGGCG ATGGCAAGn CTCCGAGGTC TCTGCAGACA AACTGGTGGC 
951 ACTGGGGCTG TTCAGCCAGC ACTTTAATTT GGCCACCTTC AATAAGCTCG 
1001 TCTCCTATCG AAAAGCCATG TACCA7GCTC TGGAGAAAGC TAGGGTGCGA 
105! GCTGGCAAGA CCTTCCCCAG CAGCCCTGGA GACTCATTGG AGGACCAGCT 
U01 GMGCCCATG TTGGAGTGGG CCCACGGGGG CTICAAGCCC ACTGGGATCG 
1151 AGGGCCTCAA ACCCAACAAC ACGCAACCAG TGGTTAATAA GTCGAAGCTG FIG. 1 D ~ 1 
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1201 CGTCGTGCAG GCAGTAGGAA ATTAGAATCA AGGAAATACG AGAACAAGAC u/38 

1251 TCGAAGACGC ACAGCrGACG ACTCAGCCAC CTCTGACTAC TGCCCCGCAC 

1301 CCAAGOGOCT CAAGACAAAT TGCTATAACA AGGGCAAAGA CCGAGGGGAT 

1351 GAAGATCAGA GCOGAGAACA MTGGCITCA GATGTTGCCA ACAACAAGAG 

1+OJ CAGCCFGGAA GATGGCTGTT TGTCTTGTGG CAGGAAAAAC CCCGTGTCCT 

H51 TCCACCCTCT CmGAGGGG GGGCTCTGTC AGACATGCCG GGATCGCTTC 

1501 CTTGAGCTGT TTTACATGTA TGATGAOGAT GGCTATCAGT CTTACTGCAC 

1551 TGTGTGCTGC GAGGGCCGAG AGCTGCTGCT TTGCAGCAAC ACGAGCTGCT 

1601 GCCGGTG7TT CTGTGTGGAG TGCCTGGAGG TGCTGGTGGG CACAGGCACA 

1651 GDGGCCGAGG CCAAGCTTCA GGAGCCCTGG AGCTCCTACA TGTGTCTCCC 

1701 GCAGCGCTGT CATGGOGTCC TGCGGOGCCG GAAGGACTGG AACGTGCGCC 

1751 TGCAGGCCTT CTTCACCAGT GACACGGGGC TTGAATACGA AGCCCCCAAG 

1801 CTGTACCCTG CCATTCCCGC AGCCCGAAGG CGGCCCATTC GAGTCCTGTC 

1851 ATTCI7TGAT GGCATCGCGA CAGGC7ACCT AGTCCTCAAA GAGTTGGGCA 

1901 TAAAGGTAGG AAAGTACGTC GCTTCTGAAG TGTGTGAGGA GTCCATTGCT 

1951 GTTGGAACCG TGAAGCACGA GGGGAATATC AAATACGTGA ACGACGTGAG 

2001 GAACATCACA AAGAAAAATA TTGAAGAATG GGGCCCATTT GACTTGGTGA 

2051 TTGGCGGAAG CCCATGCAAC GATCTCTCAA ATGTGAATCC AGCCAGGAAA 

2101 GGCCTGIATG AGGGTACAGG CCGGCTCTTC TTGGAATTTT ACCACCTGCT 

2151 GAATTACTCA CGCCCCAAGG AGGGTGATGA CCGGCCGTTC TTCTGGATGT 

2201 TTGAGAATGT TGTAGCCATG AAGGTTGGCG ACAAGAGGGA CATCTCACGG 

2251 TTCCTGCAGT GTAATCCAGT GATGATTGAT GCCATCAAAG TTTCTGCTGC 

2301 TCACAGGGCC CGATACTTCT GGGGCAACCT ACCCGGGATG AACAGGCCCG 

2351 TGATAGCATC AAAGAATGAT AAACTCGAGC TGCAGGACTG CTTGGAATAC 

2401 AATAGGATAG CCAAGHAAA GAAAGTACAG ACAArAACCA CCAAGTCGAA FIG 1D~2 
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2451 CTCGATCAAA CAGGGGAAAA ACCAACTTTT CCCTGTTGTC ATGAATGGCA 15 /3g 

2501 AAGAAGATGT TTTGTGGTGC ACTGAGCTCG AAAGGATCTT TGGCTTTCCT 

2551 GTGCACTACA CAGACGTGTC CAACATGGGC CGTGGTGCCC GCCAGAAGCi 

2601 GCTGGGAAGG TCCTGGAGCG TGCCTGTCAT CCGACACCTC TTOGCCCCTC 

2651 TCAAGGACTA CTTTGCATGT GMTAGTTCC AGCCAGGCCC CAAGCCCACT 

2701 GGGGTGTGTG GCAGAGCCAG GACCCAGGAG GTGTGATTCC TGAAGGCATC 

2751 CCCAGGCCCT GCTCTTCCTC AGCTGTGTGG GTCArACCGT GTACCTCAGT 

2801 TCCCTCTTGC TCAGTGGGGG CAGAGCCACC TGACTGTTGC AGGGGTAGCC 

2851 TGAGGTGCCG CCTCETTGTG CACAAATCAG AGCTGCCTGC TTGGAGCAGC 

2901 CTAACACGGT GCTCATTTTT TCTTCTCCTA AAACTTTAAA ACTTCAAGTA 

2951 GGTA5CAACG TGGCTTTTTT TTTTTCCCTT CCTGGGTCTA CCACTCAGAG 

300? AAACAATGGC TAAGATACCA AAACCACAGT GCCGACAGCT CTCCAATACT 

3051 CAGGTTAATG CTGAAAAATC ATCCAAGACA GTTATTGCAA GAGTTTAATT 

3101 TTTGAAAACT GGGTACTGCT A1GTGTTTAC AGAOGTGTGC AGTTGTAGGC 

3151 ATGTAGCTAC AGGACATTTT TAAGGGCCCA GGATCGTTTT TTCCCAGGGC 

3201 AAGCAGAAGA GAAAATGTTG TATATGTCTT TTACCCGGCA CATTCCCCTT 

3251 GCCTAAATAC AAGGGCTGGA GTCTGCACGG GACCTATTAG AGTATTTTCC 

3301 ACAATGATGA TGATTTCAGC AGGGATGACC TCATCATCAC ATTCAGGGCT 

3351 ATTTTTTCCC CCACAAACCC AAGGGCAGGG GCCACTCTTA GCTAMTCCC 

3401 TCCCCGTGAC TGCAATAGAA CCCTCTGGGG AGCTCAGGAA GGGGTG7GCT 

3451 GAGTTCTATA ATATAAGCTG CCATATATT7 TGTACACAAG TATGGCTCCT 

3501 CCATATCTCC CTCnCCCTA GGAGAGGAGT GTGAAGCAAC GAGCTTAGAT 

3551 AAGACACCCC CTCAAACCCA TTCCCTCTCC AGGAGACCTA CCCTCCACAG 

360 1, GCACAGGTCC CCAGATGAGA AGTCTGCTAC CCTCATTTCT CArCTTTTTA _ 

3651 CTAA.ACTCAG AGGCAGTGAC AXAGTCAGG GACAGACATA CAHTCTCAT FIG. 1D — 3 
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3701 ACCTTCCCCA CATCTGAGAG ATCACAGGGA AAACTGCAAA GCICGGTGCT 

3751 CCCTTTGGAG ATTTTTTAAT CCTTTTTTAT TCCATAAGAA GTCGITTTTA 

3801 GGGAGAACGG GAATTCAGAC AAGCTGCATT TCAGAAATGC TCTCATAATG 

3851 GTTTTTAACA CCTTTTACTC TTCTTACTGG TGCTATTTTG TAGAATAAGG 

3901 AACAACGTTG ACAAGTTTTG TGGGGCTTTT TATACACTTT TTAAMTCTC 

3951 AAACTTCTAT TTTTATGTCT AAOGTTTTCA TTAAAATITT TTTGTAACTG 

4001 v GAGCCACGAC GTAACAAATA TGGGGAAAAA ACTGTGCCTT 6TTTCAACAG 

4051 TTTTCGCTAA TTTTTAGGCT GAAAGATGAC GGATGCCTAG AGTTTACCTT 

4101 ATGTTTAATT AAAATCAGTA TTTGTCTAAA AAAAAAAAAA AAAAA 

FIG.1D-4 
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Mouse Dnmt3o Protein 
1 MPSSGPGOTS SSSLEREDDR KEGEEQEENR GKEERQEPSA TARKVGRPGR 
51 KRKHPPVESS DTPKDPAVTT KSQPMAQDSG PSDllPNGDL EKRSEPQPEE 
101 GSPAAGQKGG APAEGECTET PPEASRAVEN GCCVTKEGRG ASAGEGKEQK 
151 QTNlEStKME GSRGRLRGGL GWESSLRQRP MPRLTFQAGD PYYtSKRKRD 
201 EWLARWKRDA EKKAKV1AW NAVEENQASG ESQKVEEASP PAvQOPTDPA 
251 SPTVATTPEP VGGDAGQKNA TKAPDDEPEY EDGRGFGIGE LWGKIRGFS 
301 miPGRlVSIP MTGRSRAAEG TRWVMWFGPG KFSWCVEKl MPLSSFCSAF 
351 HQATYNKQPM YRKAIYEVIQ YASSRAGKLF PACHDSDESD SGKAVEVONK 
401 OWIEMLGGF QPSGPKGLEP PEEEKNPYKE VY7DMWVEPE AAAYAPPPPA 
451 KKPRKSTTEK PKVKEIIOER TRERLVYEVR QKCRNIEDIC ISCGSLNVTL 
501 EHPFFIGGkC QNCKNCFLEC AYOYDDDGYQ SYCTICCGGR EVLMOGNNNC 
551 CRCFCVECVD LLVGPGAAQA AIKEDPWNCY MCGHKGTYGL LRRREDWPSR 
601 IQMFFANNHD QEFDPPKVYP PVPAEKRKPI RVLSIFOGIA TGLLVLJQLG 
651 1GV0RYIASE VCEOSITVGM VRHQGKJMYV GDVRSVTQKH ICfYEPFDLY 
701 IGGSPCNDLS IVNPARKGLY EGTGRLFFEF YRLLWARPK EGODRPFFWL 
751 FENVVAMSVS DKRDISRFLE SNPVMIDAKE VSAAHRARYF WGNLPGKesiRP 
601 LASTVNDKLE LQECLEHGR! AKFSKVRTIT TRSNSIKQGK DQHFPVFfiWE 
A51 KEDILWCTEM ERVFGFPVHY TDVSNMSRLA RQRLLGRSWS VPV1RHLFAP 
901 IKEYFACV* 

FIG. 2A 
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Mouse 0nmt3b1 Protein 
T MKGDSRHLNE EEGASGYEEC I IVNSNFSOQ SSDTKDAPSP PVLEAICTEP 
51 VCTPETRGRR SSSRLSKREV SSLLNYTQOM TGDGDRDDEV ODGNGSOILM 
101 PKLTRETKDT RTRSESPAVR TRHSNGT5SL ERQRASPRIT RGRQGRHHVQ 
151 EYPVEFPATR SRRRRASSSA STPWSSPASV DFfcEEVTPKS VSTPSVDISO 
201 D60QEGMDTT QVDAESIYGO STETQODKEF GiGDLVWGKl KGfSWWPAMV 
251 VSWKATSKRQ A&PGWWVQW FGDGKFSEiS ADKLVALGLF SQHFNLATFN 
301 KLVSYRKAMY HTLEKARVRA GKTFSSSPGE SLEDQLKPML EWAH3GFKPT 
351 GIEGLKPNKK QPWNKSKVR RSDSRNLEPR RRENKSRRRT TNDSAASESP 
401 PPKRLKTNSY GGKDRGEDEE SRERMASEVT NNKGNLEDRC LSCGKKNPVS 
451 FHPLFEGGLC OSCRDRFLEL FYMYOEDGYQ SYCTVCCEGR ELLLCSNTSC 
501 CRCFCVECLE VLVGAGTAEO AKLQEPWSCY MCLPQROCV LRRRKDWMS 
551 LQOFFTTDPD LEEFEPPKLY PAIPAAKRRP IRVLSLFDGl ATGYLVLKEL 
601 G1KVEKT1AS EVCAES1AVG TVKHEGQIKY VNDVRKITKK NIEEWPFOL 
651 YIGGSPCNOL SNVNPARKGL YEGTGRLFFE FYHLLNYTRP KECDt&PFFW 
701 MFENWAMKV NOKKDISRFL ACNPVMIDAI KVSAAHRARY FVIGNLPGtfR 
751 PVMASKNOKL ELQDCLEFSR TAKLKKVQT1 TTKSNSIRQG KNQLFPWIiW 
801 GKDDVLWCTE LERIFGFPAH YTDVSNMGRG ARQKLLGRSW SVPVIRHLFA 
851 PLKDYFACE*' 

FIG.2B 
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Hunan DNMT3A Protein 

1 WAMPSSCPG OTSSSAAERE EDRKOGEEOE EPRGKEERQE PSTTARKVG8 

51 PGRKRKFPPV ESGDTPKDPA VISKSPSMAQ DSGASELLPN GDLEKRSEPQ 

101 PEEGSPAGGQ KGGAPAECEG AAETLPEASR AVENGCCTPK EGRGAPAEAG 

151 KEQKETNIES MOEGSRGRL RGGLGWESSL RORPfcPRLTF QAGDPYY1SK 

201 RKRDEWLARW KREAEKKAKV IAGMNAVEEN QGPGESHXVE EASPPAVQQP 

251 TDPASPTVA7 TPEPVGSDAG DKNATKAGDD EPEYEOGRGf GIGEilVWGKL 

301 RGFSWWPGRI VSWWiTGRSR AAEGTRVMW FGDGKFSWC VEKLMPLSSF 

351 CSAFHQATYN KQPUTRKAIY EVLQVASSRA GKLFPVCHDS OES)TAKAVE 

401 VQNKPM1EWA LGGFOHYGPK GLEPPEEEKN PYKEVYTDMN YEPEAAAYAP 

451 PPPAKKPRKS TAEKPKVKEI iDERTRERLV YEVRQKCRNI E01CISCGSL 

501 NVTLEHPLFV GGMCQNCKNC FLECAYQYDO DGYQSYCTIC CGGREVLMCG 

551 NNNCCRCFCV ECVDLLVGPG AAQAAIKEOP WNCYMCOKG TYGLLRRRKD 

601 WPSRLOAfFA NNHOQEFDPP KYYPPVPAEK RKPIRVLSLF OGIATGLLVL 

651 KDLGIQVDRY IASEVCEDSI TVGMVRHQGK IMYVGDVRSV TGKHI0EW3P 

701 FDLVIGGSPC NDLSIVNPAR KGLYEG7GRL FFEFYRLLH3 ARPKEGDDRP 

751 FFWLFENWA MGVSDKRD1S RFLESNPVM1 DAKEVSAAHR ARYFW3NLPG 

801 MNRPLASTVN DKLELQECLE HGRIAKFSKV RTITTRSNSI KQGKDQHFPV 

851 FWCKEOILW CTEMERVFGF PVHYTDYSNM SRLARQRLLG RSWSVPVIRH 

901 LFAPLKEYFA CV* 

FIG.2C 
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Human 0NMT38I Prole in 

I MKGDTRHLNG EEDAGGREDS ILVNGACSOQ SSDSPPILEA IRTPE1RGRR 

51 SSSRLSKREV SSLLSYTQOL TGOGDGEDGO GSDTPVWKL FRETRTRSES 

101 PAVRTRWNS VSSRERHRPS PRSTRGRQGR NHVDESPVEF PATRSLRRRA 

151 TASAGTPWPS PPSSYLTIDL TDOTEDTHST PQSSSTPYAR LAQDSOQGGM 

201 ESPQVEADSG DGOSSEYQDG KEFGIGOLVW GK1KGF9WP AMWSWKATS 

251 KRQAMSGMW VQWFGDGKFS EVSADKLVAL GLFSQHFNLA TFNKLVSYRX 

301 AMYHALEKAR VRAGKTFPSS PGDSIEDQLK PMLEWAHGGF KPTGIEGLKP 

351 NNTQPWNKS KVRRAGSRKL ESRKVEWTR RRTADDSATS DYCPAPKRLK 

401 TtCYNNGKDR- GDEDGSREQM ASDVANNKSS LEDGCLSCGR KNPVSFHPLF 

451 EGGLCQTCRD RFLELFYMYD DDGYQSYCTV CCEGRELLlC SNTSCCRCFC 

501 VECLEVLVGT GTAAEAKLQE PWSCYMCLPQ RCH3VLRRRK DWNVRLQAFF 

551 TSDTGLEYEA PKLYPAIPAA RRRPIRVLSL FOGIATGYLV LKELGIKVGK 

601 YVASEVCEES IAVGTYXHEG NIKYVNOVRN ITKKNIEFJG PFDLVIGGSP 

651 CNDLSNVNPA RKGLYEGTGR LFFEFYHLLN YSRPKEGOOR PFFWf ENW 

701 AMKVGOKRDI SRFLECNPVM IDAIKVSAAH RARYFVCNLP GMNRPVIASK 

751 NDKLELQOCL EYNR1AKIKK VQTITTKSNS IKQGKNQLFP WMNSKEDVL 

801 WCTELER1FG FPVHYTDYSN MGRGAROKLL GRSWSVPViR HLFAPLKDYF 

851 ACE* 



FIG. 2D 
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Dnirrt3a 1 MPS5GPGDTS5SSLEREDDRKEGEEQEENRGKEERQEPSATARKVGRPGR 50 

DnnTt3a 51 KRKHPPVE5SDTPKDPAVTTKSQPMAQDSGPSD. . . .LLPNGDLEKR5EP 96 

• |. : :: | | :: ||. -| 
Dnmt3b 1 MKGDSRHLNEEEiGASGYEECIIVNGNFSDQSSD 33 

Dnnrt3a 97 QPEEGSP .... AAGQKGGAPAEGEGTETPPEAS . RAVENGCCVTKE . . GR 139 

: il I I i • IN- |.: | 

Dnnrt3b 34 TKDAPSPPVLEAICTEPVCTPETRGRRSSSRLSICREVSSLLMyTQOMTGD 83 

Dnmt3a 140 G ASAGEG KEQKQTNI ESMKMEGSRGRLRGSLGWESSLRQ 176 

! II I ::| : I I I I I I! 

Dnmt3b 84 GDRDDEVDDGNG5DILMPKLTRETKDTRTR5ESPAVRTRHSNGTSSLERQ 133 

Dnmt3a 179 RPMPRLTFQAGDPYYISKRKRDEWLARWKRDAEKXAKVIAVKNAVEENQA 226 

I ||:| h • :: : - . . | 

Dnmt3b 134 RASPRITRGRQGRHHV QEYPVEFPATRSRRRRASSSASTPWSSPA 178 

Dnmt3a 229 SGESQKVEEASPPAVQQPTDPASPTVATTPEPVGGOAGDKf^ATXAPDDEP 278 

IV -II -i -I i I : I I -I. I 

Dnmt3b 179 SVDF . . MEEVTPKSVSTP .... SVDLSQDGDQEGMDTTQVOAESI YGOST 222 

Dnmt3a 279 EYEDGRGFGrGELVWGKLRGFSWWPGRlVSWWMTGRSRAAEGTRWVMWFG 328 

MM : ll!l:M!l|::||!l!l ! • ■! I 111 III 

Dnrot3b 223 EYQDDKEFG IGDL VWGKI KGFSWWPAM WSWKATSKRQAMPSMRW VQWFG 272 

Dnmt3a 329 DGKFSVVCVEKLMPLSSFCSAFHQATYNKQPMYRKAIYEVLQVAS5RAGK 378 

h , Hill = =11- i I I* INI Hil l I: I llll 

Dnmt3b 273 DGKFSEISAOKLVALGLFSQHFNLATFNKLVSYRKAMYHTLEKARVRAGK 322 

Dnnrt3a 379 LFPACHDSDESDSGKAVEVQNKQMIEWAL6GFQPS6PKGLEPPEEEK. .N 426 

I I I--- 1 I I hill lll-l-l -iUI ..: | 

Dnmt3b 323 IF SSSPGESLEDQUCPHLEWAHGGFKPTCIEGLKPNKKQPVVN 365 

Dnmt3a 427 PYKEVYTDMW. VEP EAAAYAPPPPAJCKPRKSTTEKPK 462 

I -I -II III : I i : 

Dnmt3b 366 K5KVRRSDSRNLEPRRRENK5RRRTTNDSAASESPPPKRLKTNSYGGKDR 415 

FIG.3A-1 
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Dnmt3a 463 VKEI IDERTRERLVYEVRQKCF 

- II -ill: II 
Dnmt3b 416 GE. . . DEESRERMASEV7NNK 

Dnmt3a 513 



|tjl£DICISCGSLN\n:LEHP.FFlGGMGQN 
tWLEDRaSCGKKfvPVSFHPlFEGGLCQS) 



Dnmt3b 
Dnmt3a 
Dnmt3b 
Dnmt3a 
Dnmt3b 
Dnnvt3a 
Dnmt3b 
Dnmt3a 
Dnmt3b 
Dnmt3a 
Dnmt3b 
Dnmt3a 
Dnmt3b 
Dnmt3a 
Dnjnt3b 



463 
563 
513 



CKtCFLECAYQYDDDGYQSYCnCCGSREV^^ 

i:^'ti;r .i nun.::!!, iii^hi i "a nmm.~--\\ 

CRDRFLEjTY^YDEDGYQSycTVCGF.GRELLLCSWTSCCRCFCVEGLEVL 



512 
462 
562 
512 



V GPG AAQAAI KEDPiMC Y|JG 



V GAGTAEDAK LQEP WSC YMC 



GHK6TYGLLRRREDWPSRLQWFFANNHD.Q 611 

. :].|||UI 111 II • I : 
LPQRCHGVLRRRKDWNMRLQDFFTTDPDLE 562 



612 EFDPPKVYPPVPAEKRKPIRVLSLFDGIA*nSLLVLKDLGlQVDRYIASEV 661 

IMIMI :|| 11:11111111111111 1 1 1 1 : 1 1 1 - 1 : : 1 1 1 1 1 1 

563 EFEPPKLYPAIPAAKRRPIRVLSLFDGIATGYLVLKELGrKVEtCYIASEV 612 

652 CED5ITVGMVRHQGKIMYVGDVRSVTQKHIQEWGPF0LVIGGSPCNDLSI 711 

i HI l|-|:'|:t-l II 111 : I • U = 1 1 II 1 1 1 1 1 ! 1 1 [ 1 1 M I 

613 CAE5IAVGTVKHEGQIKYVNDVRKITKKKIEEWGPFDLVIGGSPCNDLSN 662 

712 VWPARKGLYEGTGRLFFEFYRLLHDARPKEGODRPFFWLFENVVAMGVSD 761 

IliillllMllllinill IN lll|IMII.I|:llli||| M 

663 VNPARKGLYEGTBRLFFERHLLim-RPKEGDNRPFFWMFENWAMKVND 712 

762 KRDISRFLESNPVMIDAKEV5AAHRARYFWGNLPGMNRPLASTVWDKLEL 811 

hlillll Illllll •llllltlllll!iill!lfl- Ill I! | 

713 KKDISRFLACNPVMIDAIKVSAAHRARYFWGMLPGMNRPVMASKND1CLEL 762 

812 QECLEHGRIAKF5 KVRT I TTRSMS I KQGKDQHFP VFHN EKED I LWCTEME 861 
MM I II [|.||!|:||||:ilM III II |:i:|||||:| 

763 ODCLEFSRTAKLKKVQTIT7XSNSIR(ffiKNQLFPVVMNGKDDVLWCTELE S12 

862 RVFGFPVHYTOVSMMSRLARQRLLfiRSWSVPVIRHLFAPLICEYFACV* 909 

Mill llllllll I llhjllllllllllillllllhllil I 

81 3 RI FGF PAHYTD VSNHGRGARQKLLGRSWS VP V I RHL FAPL KD YF ACE* 860 
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DNMT3A 1 MPAMPSSGPGDTSSSAAEREEDRKDGEEQEEPRGKEERQEPSTTARKVGR 

0NMT3A ' 51 PGRKRKHPPVESGDTPKDPAVISKSPSMAQDSGASELLPNGDLEKRSEPQ 

i III- |: 

DNMT3B 1 HKGDTRHLNGEEQAGGREDSI IVNGACS DQSSDSP 

DNMT3A 101 PEEGSPAGGQKGGAPAEGEGAAETLPEASRAVENGCCTPKEGRGAPAEAG 

' I . : I I . . I .: 

DNHT3B 36 PILEAIRTPEIRGGWAS5RLSKREVSSLLSYTQDLTGDGOGEDGDGSDTP 

DNMT3A 151 KEQKETKIESMKMEGSRGRLRGGLGWESSLRQRPMPRLTFQAGDPYnSK 

i : I I I I I II !i ! 

DMMT3B B6 VMPKLFRETRTRSESPAVRTRNNNSVSSRERHRPSPR5TRGRQGRNHVDE 

DNMT3A 201 RKRDEWLARWKREAEKICAKV I AGMNAVEENQGPGE5 HKVEEASP PAVQQP 

DNMT3B 136 SPVEFPATRSIRRRATAS AGTPWPSPPSSYLTIDLTDDTED7H . . 6TPQS 

DNMT3A 251 TDPASPTVATTPEPVGSDAGDKNATXAGDDEPEYEDGRGFGIGELVWGKL 

• I : I .1 1 ||:||: 1 1 1 1 : 1 1 1 I I : 

DNMT3B 184 SSTPYARLAQDSQQGGMESPQVEADSGDGD5SEYQDGKEFGIGDLVWGKI 

DNMT3A 301 RGFSWWPGRIVSVMfTGRSRMEGTRWVHWrGDGKFSVVCVEKLMPLSSF 

:||| I : •! I III llllllll I :||. I 1 
DNMT3B 234 KGFSWPAMVVSWKATSKRQAMSGMRWVQWFGDGKFSEVSAOKLVALGLF 

DNMT3A 351 CSAFHQATYNKQPMYRKAIYEYLQVASSRAGKLFPVCHDSDESDTAKAVE 

i l!:|| iilU |: I Mil I! I '••! 

DNMT3B 284 SQHFNLATFNKLVSYRKAHYHALEKARVRAGKTFP SSPGDSLE 

DNMT3A 401 VQNKPMIEWALGGFQHYGPKGLEP. . . .PEEEKWPYKEVYTOMWVE. . . 

I IIMII III- I -||.| 
DNMT3B 327 DQLKPMLEWAHGGFKPTGIEGLKPNf^TQPVVNKSKVRRA.GSRKLESRKYE 

DNMT3A 443 PEAAAYAPPPPAKJCPRKSTAEKPKVKEIIOERTRERLVYEVRQ 

- ! lllh ::.-l|.: :[ 

DNMT3B 377 NKTRRRTADDSATSDYCPAPKRLKTNCYNNGKDRGDEDQSREQMASDVAN 



F1G.3B-1 
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DNMT3A 486 KCRNIEDICISCGSLNVTLEHPLFVQGMCQNCKNCFLECAYQYDD0GYQ5 

.:!! hill I III! Ihll 1:. Ill I 11111111 
DNMT3S 427 NKSSLEDGCLSCGRKWPVSFHPLFEGGLCQTCRDRFLELFYMYDDDGYQS 

DMMT3A 536 YCTICCGGREVLMCGNNNCCRCFCVECVDLLVGPGMQAAIKEDPWNCYM 

111:11 111- hi 1 .111111111-:. Ill I I I = = 1 1 - 1 1 I 
DNMT3S 477 YCTVCCEGRELLLCSNTSCCRCFCVECLEVLVGTGTAAEAKLQEPWSCYM 

DNMT3A 586 CGHKGTYGLLRRR1CDWPSRLQMFFANNHDQEFDPPKVYPPVPAEKRKPIR 
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